Parikh's Theorem and Descriptional Complexity
نویسندگان
چکیده
This thesis was carried out in the Laboratorio di Linguaggi e Combinatoria (LIN.COM), at the Dipartimento di Informatica e Comunicazione, Università degli Studi di Milano. The thesis deals with some topics in the theory of formal languages, and specifically with the theory of context-free languages and the study of theirs descriptional complexity. The descriptional complexity of a formal structure (grammar, model of automata, etc) is the number of symbols needed to write down its description. While this aspect is extensively treated in regular languages, as evidenced by numerous references, in the case of context-free languages few results are known. The thesis starts studying the Parikh’s theorem. The theorem states that for each context-free language there exists a regular language with the same Parikh image. The Parikh image of a language L ⊆ Σ∗ is the set of Parikh images of its words. Given an alphabet Σ = {a1, . . . , am}, the Parikh image is a function ψ : Σ∗ → N m that associates with each word w ∈ Σ∗, the vector ψ(w) = (|w|a1 , |w|a2 , ..., |w|am), where |w|ai is the number of occurrences of ai in w. For instance, the language {anbn | n ≥ 0} has the same Parikh image as (ab)∗. Roughly speaking, the theorem shows that if the order of the letters in a word is disregarded, retaining only the number of their occurrences, then context-free languages are indistinguishable from regular languages. The classical proofs of Parikh’s theorems implicitly provide the construction of a nondeterministic automaton accepting a language “Parikh-equivalent” to the context-free language under consideration. The number of the states of such an automaton is double exponential in the size of the given context-free grammar. In
منابع مشابه
Parikh's Theorem Does Not Hold for Multiplicities
We consider the question of whether the famous Parikh's theorem holds with multiplicities i.e., for formal power series instead of languages. We introduce two families of semilinear formal power series over a commutative product monoid. The strict hierarchy of algebraic, rational, recognizable and semi-linear formal power series is proved and in this way it is established that the Parikh's theo...
متن کاملProving Parikh's theorem using Chomsky-Schutzenberger theorem
Parikh theorem was originally stated and proved by Rohkit Parikh in MIT research report in 1961. Many different proofs of this classical theorems were produced then; our goal is to give another proof using Chomsky-Schutzenberger representation theorem. We present the proof which doesn't use any formal language theory tool at all except the representation theorem, just some linear algebra.
متن کاملA Fully Equational Proof of Parikh's Theorem
We show that the validity of Parikh’s theorem for context-free languages depends only on a few equational properties of least pre-fixed points. Moreover, we exhibit an infinite basis of μ-term equations of continuous commutative idempotent semirings. AMS Subject Classification (2000): 03C05, 16Y60, 68Q42, 68Q45, 68Q70.
متن کاملExtending Parikh's Theorem to Weighted and Probabilistic Context-Free Grammars
We prove an analog of Parikh’s theorem for weighted context-free grammars over commutative, idempotent semirings, and exhibit a stochastic context-free grammar with behavior that cannot be realized by any stochastic right-linear context-free grammar. Finally, we show that every unary stochastic context-free grammar with polynomially-bounded ambiguity has an equivalent stochastic right-linear co...
متن کاملOn semilinearity in formal power series
A notion of semilinearity is introduced for formal power series as a natural generalization of semilinear languages over a commutative monoid. Some closure properties of the semilinear languages are established also for formal power series. In this way, a classical result due to Eilenberg and Sch utzenberger and some results due to Ginsburg are extended to series. We also prove that Parikh's t...
متن کامل